Graph Based Local Recoding for Data Anonymization
نویسندگان
چکیده
Releasing person specific data could potentially reveal the sensitive information of an individual. kanonymity is an approach for protecting the individual privacy where the data is formed into set of equivalence classes in which each class share the same values. Among several methods, local recoding based generalization is an effective method to accomplish k-anonymization. In this paper, we proposed a minimum spanning tree partitioning based approach to achieve local recoding. We achieve it in two phases. During the first phase, MST is constructed using concept hierarchical and the distances among data points are considered as the weights of MST and in the next phase we generate the equivalence classes adhering to the anonymity requirement. Experiments show that our proposed local recoding framework produces better quality in published tables than existing Mondrian global recoding and k-member clustering approaches.
منابع مشابه
An Effective Method for Utility Preserving Social Network Graph Anonymization Based on Mathematical Modeling
In recent years, privacy concerns about social network graph data publishing has increased due to the widespread use of such data for research purposes. This paper addresses the problem of identity disclosure risk of a node assuming that the adversary identifies one of its immediate neighbors in the published data. The related anonymity level of a graph is formulated and a mathematical model is...
متن کاملAn Local-recoding Anonymization with Mapreduce for Scalable Big Data Privacy Preservation in Cloud
Data privacy preservation is one of the most disturbed issues on the current industry. Data privacy issues need to be addressed urgently before data sets are shared on cloud. Data anonymization refers to as hiding complex data for owners of data records. Cloud computing provides promising scalable IT infrastructure to support various processing of a variety of big data applications in sectors s...
متن کاملBottom-Up Cell Suppression that Preserves the Missing-at-random Condition
This paper proposes a cell-suppression based k-anonymization method which keeps minimal the loss of utility. The proposed method uses the Kullback-Leibler (KL) divergence as a utility measure derived from the notions developed in the literature of incomplete data analysis, including the missing-at-random (MAR) condition. To be more specific, we plug the KL divergence into an bottom-up, greedy p...
متن کاملLightning: Utility-Driven Anonymization of High-Dimensional Data
The ARX Data Anonymization Tool is a software for privacy-preserving microdata publishing. It implements methods of statistical disclosure control and supports a wide variety of privacy models, which are used to specify disclosure risk thresholds. Data is mainly transformed with a combination of two methods: (1) global recoding with full-domain generalization of attribute values followed by (2)...
متن کاملLocal recoding by maximum weight matching for disclosure control of microdata sets
We propose “local recoding” as a new technique for controlling disclosure risk of microdata sets. Compared to the technique of global recoding, where the observed values are grouped into broader intervals or categories throughout the data set, in local recoding different grouping is performed for each observation when necessary. As a means of performing local recoding we propose to form pairs o...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2013